Relative Variational Intrinsic Control

نویسندگان

چکیده

In the absence of external rewards, agents can still learn useful behaviors by identifying and mastering a set diverse skills within their environment. Existing skill learning methods use mutual information objectives to incentivize each be distinguishable from rest. However, if care is not taken constrain ways in which are diverse, trivially sets arise. To ensure diversity, we propose novel objective, Relative Variational Intrinsic Control (RVIC), incentivizes that how they change agent's relationship its The resulting tiles space affordances available agent. We qualitatively analyze on multiple environments show RVIC more than discovered existing hierarchical reinforcement learning.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Variational Intrinsic Control

We introduce a new unsupervised reinforcement learning method for discovering the set of intrinsic options available to an agent. This set is learned by maximizing the number of different states an agent can reliably reach, as measured by the mutual information between the set of options and option termination states. To this end, we instantiate two policy gradient based algorithms, one that cr...

متن کامل

Relative Information Based Distributed Control for Intrinsic Formations of Reduced Attitudes Relative Information Based Distributed Control for Intrinsic Formations of Reduced Attitudes

This dissertation concerns the formation problems for multiple reduced attitudes, which are extensively utilized in many pointing applications and under-actuated scenarios for attitude maneuvers. In contrast to most existing methodologies on formation control, the proposed method does not need to contain any formation errors in the protocol. Instead, the constructed formation is attributed to g...

متن کامل

On Variational Expressions for Quantum Relative Entropies

Distance measures between quantum states like the trace distance and the fidelity can naturally be defined by optimizing a classical distance measure over all measurement statistics that can be obtained from the respective quantum states. In contrast, Petz showed that the measured relative entropy, defined as a maximization of the Kullback-Leibler divergence over projective measurement statisti...

متن کامل

Relative local variational principles for subadditive potentials

We prove two relative local variational principles of topological pressure functions P (T,F ,U , y) and P (T,F ,U|Y ) for a given factor map π, an open cover U and a subadditive sequence of real-valued continuous functions F . By proving the upper semi-continuity and affinity of the entropy maps h{·}(T,U | Y ) and h+{·}(T,U | Y ) on the space of all invariant Borel probability measures, we show...

متن کامل

Variational Principle for Relative Tail Pressure

We introduce the relative tail pressure to establish a variational principle for continuous bundle random dynamical systems. We also show that the relative tail pressure is conserved by the principal extension.

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2021

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v35i8.16832